Summarizability in OLAP and Statistical Data Bases
نویسندگان
چکیده
Summarizability of OLAP and Statistical Databases is an a extremely important property because violating this condition can lead to erroneous conclusions and decisions. In this paper we explore the conditions for summarizability. We introduce a framework for specifying precisely the context in which statistical objects are defined. We use a three step process to define normalized statistical objects. Using this framework, we identify three necessary conditions for summarizability. We provided specific tests for each of the conditions that can be verified either from semantic knowledge, or by checking the statistical database itself. We also provide the reasoning for our belief that these three summarizability conditions are sufficient as well.
منابع مشابه
A Taxonomy of Inaccurate Summaries and Their Management in OLAP Systems
Accurate summarizability is an important property in OLAP systems because inaccurate summaries can result in poor decisions. Furthermore, it is important to understand and identify the potential sources of inaccurate summaries. In this paper, we present a taxonomy of inaccurate summary factors and practical rules for handling them. We consolidate relevant terms and concepts in statistical datab...
متن کاملReasoning about Summarizability in Heterogeneous Multidimensional Schemas
In OLAP applications, data are modeled as points in a mul-tidimensional space. Dimensions themselves have structure, described by a schema and an instance; the schema is basically a directed acyclic graph of granularity levels, and the instance consists of a set of elements for each level and mappings between these elements, usually called rollup functions. Current dimension models restrict dim...
متن کاملEmpowering the OLAP Technology to Support Complex Dimension Hierarchies
Comprehensive data analysis has become indispensable in a variety of domains. OLAP (On-Line Analytical Processing) systems tend to perform poorly or even fail when applied to complex data scenarios. The restriction of the underlying multidimensional data model to admit only homogeneous and balanced dimension hierarchies is too rigid for many real-world applications and, therefore, has to be ove...
متن کاملA Novel Query-Based Approach for Addressing Summarizability Issues in XOLAP
The business intelligence and decision-support systems used in many application domains casually rely on data warehouses, which are decision-oriented data repositories modeled as multidimensional (MD) structures. MD structures help navigate data through hierarchical levels of detail. In many real-world situations, hierarchies in MD models are complex, which causes data aggregation issues, colle...
متن کاملProblèmes d'additivité dus à la présence de hiérarchies complexes dans les modèles multidimensionnels : définitions, solutions et travaux futurs
Résumé. De nos jours, les entrepôts de données et les outils d’analyse OLAP sont très utilisés dans les entreprises qui ont besoin de systèmes décisionnels qui s’adaptent à toutes les situations particulières du monde réel, pour éviter les erreurs d’analyse (plus connues dans la littérature sous le nom de problèmes d’additivité ou summarizability issues en Anglais). Dans cet article, nous prése...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997